Notes on Single-Pass Online Learning Algorithms

نویسندگان

  • Vitor R. Carvalho
  • William W. Cohen
چکیده

Online learning methods are typically faster and have a much smaller memory footprint than batch learning methods. However, in practice online learners frequently require multiple passes over the same training data in order to achieve accuracy comparable to batch learners. We investigate the problem of single-pass online learning, i.e., training only on a single pass over the data. We compare the performance of single-pass online learners to traditional batch learning, and we propose a new modification of the Margin Balanced Winnow algorithm that can reach results comparable to linear SVM for several NLP tasks. We also explore the effect of averaging, a.k.a. voting, on online classifiers. We provide experimental evidence that voting can be successfully used to boost the performance of several single-pass online learning algorithms. Finally, we describe how the Modified Margin Balanced Winnow algorithm proposed can be naturally adapted to perform online feature selection. This scheme performs comparably to information gain or chi-square, with the advantage of being able to select features on-the-fly.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Fuzzy Stabilizer Based on Online Learning Algorithm for Damping of Low-Frequency Oscillations

A multi objective Honey Bee Mating Optimization (HBMO) designed by online learning mechanism is proposed in this paper to optimize the double Fuzzy-Lead-Lag (FLL) stabilizer parameters in order to improve low-frequency oscillations in a multi machine power system. The proposed double FLL stabilizer consists of a low pass filter and two fuzzy logic controllers whose parameters can be set by the ...

متن کامل

The Effect of Online Learning Tools on L2 Reading Comprehension and Vocabulary Learning

The aim of this study was to investigate the effects of various online techniques (word reference, media, and vocabulary games) on reading comprehension as well as vocabulary comprehension and production. For this purpose, 60 language learners were selected and divided into three groups, and each group was randomly assigned to one of the treatment conditions. In the first session of tre...

متن کامل

Efficient Algorithm for Hierarchical Online Mining of Association Rules

-------------------------------------------------------------------------ABSTRACT---------------------------------------------------------------Several multi-pass algorithms have been proposed for Association Rule Mining from static repositories. However, such algorithms are incapable of online processing of transaction streams. In this paper we introduce an efficient single-pass algorithm for ...

متن کامل

Predicting Risk of Failure in Online Learning Platforms Using Machine Learning Algorithms for Modeling Students' Academic Performance

Online learning platforms such as Moodle and MOOC have become popular in higher education. These platforms provide information that are potentially useful in developing new student learning models and predicting outcomes, such as pass/fail and final grade prediction. Rather than grade book, another source of information provided by these platforms are in the form of metadata, namely data descri...

متن کامل

A Sparse Nonlinear Classifier Design Using AUC Optimization

AUC (Area under the ROC curve) is an important performance measure for applications where the data is highly imbalanced. Learning to maximize AUC performance is thus an important research problem. Using a max-margin based surrogate loss function, AUC optimization problem can be approximated as a pairwise rankSVM learning problem. Batch learning methods for solving the kernelized version of this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006